Hippocampal Formation Breaks Combinatorial Explosion for Reinforcement Learning: A Conjecture

نویسنده

  • András Lörincz
چکیده

There is surmounting evidence that reinforcement learning (RL) is a good model for the dopamine system of the brain and the prefrontal cortex. RL is also promising from the algorithmic point of view, because recent factored RL algorithms have favorable convergence and scaling properties and can counteract the curse of dimensionality problem, the major obstacle of practical applications of RL methods. Learning in navigation tasks then separates (i) to the search and the encoding of the factors, such as position, direction, and speed, and (ii) to the optimization of RL decision making by using these factors. We conjecture that the main task of the hippocampal formation is to separate factors and encode into neocortical areas the different lowdimensional conjunctive representations of them to suit factored RL value estimation. The mathematical framework is sketched. It includes convergent factored RL model and autoregressive (AR) hidden process model that finds factors including the hidden causes. The AR model is mapped to the hippocampal formation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The hippocampus and cerebellum in adaptively timed learning, recognition, and movement.

The concepts of declarative memory and procedural memory have been used to distinguish two basic types of learning. A neural network model suggests how such memory processes work together as recognition learning, reinforcement learning, and sensorimotor learning take place during adaptive behaviors. To coordinate these processes, the hippocampal formation and cerebellum each contains circuits t...

متن کامل

Looking for Scalable gents

Reinforcement Learning intends to ease and possibly to perform automatically the design of systems such as software or robot agents. An important aspect is the ability of learning agents to adapt to their environment and to the task they have to accomplish. This kind of learning is unfortunately restrained by problems like combinatorial explosion of the state space that limits the number of sen...

متن کامل

Learning to Weigh Basic Behaviors in Scalable gents

We are working on the use of Reinforcement Learning (RL)[3] algorithms to design automatically reactive situated agents limited to only local perceptions. Unfortunately, as good RL algorithms suffer from combinatorial explosion, their use is generally limited to simple problems. As shown on the tile-world example of figure 1, we propose to overcome these difficulties by making the hypothesis, a...

متن کامل

Joint Learning in Stochastic Games: Playing Coordination Games Within Coalitions

Despite the progress in multiagent reinforcement learning via formalisms based on stochastic games, these have difficulties coping with a high number of agents due to the combinatorial explosion in the number of joint actions. One possible way to reduce the complexity of the problem is to let agents form groups of limited size so that the number of the joint actions is reduced. This paper inves...

متن کامل

Assessment of the role of NMDA receptors located in hippocampal CA1 area on the effects of oral morphine dependency on spatial learning and memory in rat

Introduction: It has been reported that oral morphine dependency facilitated formation of spatial learning and memory. In the present study the role of NMDA receptors located in hippocampal CA1 area of morphine dependent rats was studied. Methods: Male rats were divided into 4 groups. Two cannulae were stereotaxically implanted bilaterally into the hippocampal CA1 area. After 5 days recover...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008